智能论文笔记

AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data

Amin Banitalebi-Dehkordi , Pratik Gujjar , Yong Zhang

分类：计算机视觉 | 人工智能 | 机器学习

2022-06-14

半监督学习（SSL）在稀缺标记的数据时取得了长足的进步，但未标记的数据丰富。至关重要的是，最近的工作假设这种未标记的数据是从与标记数据相同的分布中汲取的。在这项工作中，我们表明，在存在未标记的辅助数据的情况下，最先进的SSL算法在性能下遭受了降解，这些数据不一定具有与标签集相同的类别分布。我们将此问题称为辅助-SSL，并提出了AuxMix，这是一种利用自我监督的学习任务来学习通用功能，以掩盖与标记的集合在语义上相似的辅助数据。我们还建议通过最大化不同辅助样品的预测熵来正规化学习。当在CIFAR10数据集中培训带有4K标记的样品时，我们在Resnet-50型号上显示了5％的改善，并且从Tiny-ImageNet数据集中绘制所有未标记的数据。我们报告了几个数据集的竞争结果，并进行消融研究。

translated by 谷歌翻译

Local Differential Privacy for Sequential Decision Making in a Changing Environment

Pratik Gajane

分类：机器学习

2023-01-02

We study the problem of preserving privacy while still providing high utility in sequential decision making scenarios in a changing environment. We consider abruptly changing environment: the environment remains constant during periods and it changes at unknown time instants. To formulate this problem, we propose a variant of multi-armed bandits called non-stationary stochastic corrupt bandits. We construct an algorithm called SW-KLUCB-CF and prove an upper bound on its utility using the performance measure of regret. The proven regret upper bound for SW-KLUCB-CF is near-optimal in the number of time steps and matches the best known bound for analogous problems in terms of the number of time steps and the number of changes. Moreover, we present a provably optimal mechanism which can guarantee the desired level of local differential privacy while providing high utility.

translated by 谷歌翻译

Skeletal Video Anomaly Detection using Deep Learning: Survey, Challenges and Future Directions

Pratik K. Mishra , Alex Mihailidis , Shehroz S. Khan

分类：计算机视觉

2022-12-31

The existing methods for video anomaly detection mostly utilize videos containing identifiable facial and appearance-based features. The use of videos with identifiable faces raises privacy concerns, especially when used in a hospital or community-based setting. Appearance-based features can also be sensitive to pixel-based noise, straining the anomaly detection methods to model the changes in the background and making it difficult to focus on the actions of humans in the foreground. Structural information in the form of skeletons describing the human motion in the videos is privacy-protecting and can overcome some of the problems posed by appearance-based features. In this paper, we present a survey of privacy-protecting deep learning anomaly detection methods using skeletons extracted from videos. We present a novel taxonomy of algorithms based on the various learning approaches. We conclude that skeleton-based approaches for anomaly detection can be a plausible privacy-protecting alternative for video anomaly detection. Lastly, we identify major open research questions and provide guidelines to address them.

translated by 谷歌翻译

Privacy-Protecting Behaviours of Risk Detection in People with Dementia using Videos

Pratik K. Mishra , Andrea Iaboni , Bing Ye , Kristine Newman , Alex Mihailidis , Shehroz S. Khan

分类：计算机视觉

2022-12-20

People living with dementia often exhibit behavioural and psychological symptoms of dementia that can put their and others' safety at risk. Existing video surveillance systems in long-term care facilities can be used to monitor such behaviours of risk to alert the staff to prevent potential injuries or death in some cases. However, these behaviours of risk events are heterogeneous and infrequent in comparison to normal events. Moreover, analyzing raw videos can also raise privacy concerns. In this paper, we present two novel privacy-protecting video-based anomaly detection approaches to detect behaviours of risks in people with dementia. We either extracted body pose information as skeletons and use semantic segmentation masks to replace multiple humans in the scene with their semantic boundaries. Our work differs from most existing approaches for video anomaly detection that focus on appearance-based features, which can put the privacy of a person at risk and is also susceptible to pixel-based noise, including illumination and viewing direction. We used anonymized videos of normal activities to train customized spatio-temporal convolutional autoencoders and identify behaviours of risk as anomalies. We show our results on a real-world study conducted in a dementia care unit with patients with dementia, containing approximately 21 hours of normal activities data for training and 9 hours of data containing normal and behaviours of risk events for testing. We compared our approaches with the original RGB videos and obtained an equivalent area under the receiver operating characteristic curve performance of 0.807 for the skeleton-based approach and 0.823 for the segmentation mask-based approach. This is one of the first studies to incorporate privacy for the detection of behaviours of risks in people with dementia.

translated by 谷歌翻译

FedUKD: Federated UNet Model with Knowledge Distillation for Land Use Classification from Satellite and Street Views

Renuga Kanagavelu , Kinshuk Dua , Pratik Garai , Susan Elias , Neha Thomas , Simon Elias , Qingsong Wei , Goh Siow Mong Rick , Liu Yong

分类：计算机视觉 | 机器学习

2022-12-05

Federated Deep Learning frameworks can be used strategically to monitor Land Use locally and infer environmental impacts globally. Distributed data from across the world would be needed to build a global model for Land Use classification. The need for a Federated approach in this application domain would be to avoid transfer of data from distributed locations and save network bandwidth to reduce communication cost. We use a Federated UNet model for Semantic Segmentation of satellite and street view images. The novelty of the proposed architecture is the integration of Knowledge Distillation to reduce communication cost and response time. The accuracy obtained was above 95% and we also brought in a significant model compression to over 17 times and 62 times for street View and satellite images respectively. Our proposed framework has the potential to be a game-changer in real-time tracking of climate change across the planet.

translated by 谷歌翻译

Generalised Spherical Text Embedding

Souvik Banerjee , Bamdev Mishra , Pratik Jawanpuria , Manish Shrivastava

分类：自然语言处理

2022-11-30

This paper aims to provide an unsupervised modelling approach that allows for a more flexible representation of text embeddings. It jointly encodes the words and the paragraphs as individual matrices of arbitrary column dimension with unit Frobenius norm. The representation is also linguistically motivated with the introduction of a novel similarity metric. The proposed modelling and the novel similarity metric exploits the matrix structure of embeddings. We then go on to show that the same matrices can be reshaped into vectors of unit norm and transform our problem into an optimization problem over the spherical manifold. We exploit manifold optimization to efficiently train the matrix embeddings. We also quantitatively verify the quality of our text embeddings by showing that they demonstrate improved results in document classification, document clustering, and semantic textual similarity benchmark tests.

translated by 谷歌翻译

SketchySGD: Reliable Stochastic Optimization via Robust Curvature Estimates

Zachary Frangella , Pratik Rathore , Shipu Zhao , Madeleine Udell

分类：机器学习

2022-11-16

We introduce SketchySGD, a stochastic quasi-Newton method that uses sketching to approximate the curvature of the loss function. Quasi-Newton methods are among the most effective algorithms in traditional optimization, where they converge much faster than first-order methods such as SGD. However, for contemporary deep learning, quasi-Newton methods are considered inferior to first-order methods like SGD and Adam owing to higher per-iteration complexity and fragility due to inexact gradients. SketchySGD circumvents these issues by a novel combination of subsampling, randomized low-rank approximation, and dynamic regularization. In the convex case, we show SketchySGD with a fixed stepsize converges to a small ball around the optimum at a faster rate than SGD for ill-conditioned problems. In the non-convex case, SketchySGD converges linearly under two additional assumptions, interpolation and the Polyak-Lojaciewicz condition, the latter of which holds with high probability for wide neural networks. Numerical experiments on image and tabular data demonstrate the improved reliability and speed of SketchySGD for deep learning, compared to standard optimizers such as SGD and Adam and existing quasi-Newton methods.

translated by 谷歌翻译

An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning

Danil Provodin , Pratik Gajane , Mykola Pechenizkiy , Maurits Kaptein

分类：机器学习

2022-09-08

我们研究了在约束强化学习中有效探索的后验抽样方法。或者，对于现有算法，我们提出了两种简单的算法，这些算法在统计上更有效，更简单地实现和计算便宜。第一种算法基于CMDP的线性公式，第二算法利用CMDP的鞍点公式。我们的经验结果表明，尽管具有简单性，但后取样可实现最先进的表现，在某些情况下，采样明显优于乐观算法。

translated by 谷歌翻译

The Value of Out-of-Distribution Data

Ashwin De Silva , Rahul Ramesh , Carey E. Priebe , Pratik Chaudhari , Joshua T. Vogelstein

分类：机器学习 | 人工智能 | 计算机视觉 | (统计)机器学习

2022-08-23

更多数据有助于我们推广到任务。但是实际数据集可以包含分布（OOD）数据；这可以以异质性的形式出现，例如类内变异性，也可以以时间变化或概念漂移的形式出现。我们在此类问题上展示了一种反直觉现象：任务的概括误差可能是OOD样本数量的非单调函数；少数OOD样品可以改善概括，但是如果OOD样品的数量超出了阈值，则概括误差可能会恶化。我们还表明，如果我们知道哪些样品是OOD，则使用目标和OOD样品之间的加权目标确保概括误差单调减少。我们使用线性分类器在CIFAR-10上的合成数据集和中型神经网络上使用线性分类器演示和分析了此问题。

translated by 谷歌翻译

Riemannian accelerated gradient methods via extrapolation

Andi Han , Bamdev Mishra , Pratik Jawanpuria , Junbin Gao

分类：机器学习 | (统计)机器学习

2022-08-13

在本文中，我们通过推断在歧管上的迭代来提出一种简单的加速度方案，用于利曼梯度方法。我们显示何时从Riemannian梯度下降法生成迭代元素，加速方案是渐近地达到最佳收敛速率，并且比最近提出的Riemannian Nesterov加速梯度方法在计算上更有利。我们的实验验证了新型加速策略的实际好处。

translated by 谷歌翻译